Skip to content

NicheNet Analysis

Author: SeekGene
Time: 7 min
Words: 1.3k words
Updated: 2026-01-26
Reads: 0 times
SeekSoul™ Online

Preface

IMPORTANT

NicheNet is an analytical tool that can infer ligand-target gene regulation in receptor cells from expression data and prior knowledge, suitable for cell communication studies in single-cell and spatial transcriptomics data.

This training document is for bioinformatics analysts and biologists using the SeekSoul™ Online platform, aiming to introduce NicheNet's principles, common parameter settings, result interpretation, and operation key points on the platform.


NicheNet Theoretical Basis

Core Assumptions

NicheNet is based on the following assumptions: Ligands bind to receptors, trigger intracellular signal transduction in receptor cells, and ultimately affect transcription factors and regulate the expression of specific target genes. Therefore, to infer ligand regulation of target genes, NicheNet integrates ligand-receptor, signal transduction, and transcriptional regulatory networks into a prior model.

Data Integration

NicheNet primarily integrates:

  1. Ligand-receptor interaction databases (known L-R pairs)
  2. Protein-protein interaction and signal transduction networks (receptor → signaling proteins → TFs)
  3. Transcription factor-target gene regulatory networks

This information is integrated into weighted networks, and network propagation algorithms are used to calculate each ligand's regulatory potential score to candidate target genes.

NOTE

The network propagation algorithm is the core innovation of NicheNet. It simulates signal propagation strength attenuation and multi-pathway synergistic effects in the network, thereby deriving the regulatory potential of ligands on target genes.

Scoring and Statistical Evaluation

Common metrics: AUROC, AUPR, AUPR_corrected (for unbalanced data correction), Pearson correlation coefficient. NicheNet typically uses aupr_corrected as the main basis for ligand activity ranking.


Technical Implementation

Algorithm Workflow (Overview)

  1. Data preprocessing: Expression matrix normalization, cell annotation, and quality control
  2. Differential gene analysis: Identify interested target gene sets in receptor cells
  3. Ligand candidate selection: Screen potential ligands based on expression and biological priors
  4. Ligand activity analysis: Calculate and rank ligand regulatory potential on target gene sets based on network propagation
  5. Target gene prediction: Predict high-confidence target genes for TOP ligands and visualize

Differences Between Single-Cell and Spatial Data

Data TypePreprocessing FocusAnalysis Recommendations
Single-cell RNA-seqCell clustering and annotation, batch correctionUse cell type groups as sender/receiver units
Spatial transcriptomicsPreserve spatial coordinates, section quality controlOnly consider spatially neighboring cells as senders/receivers to reduce false positives

WARNING

Spatial data must consider cell proximity. Direct communication between distant cells is less likely, avoid blindly using all cells as senders.

Parameter Recommendations (Empirical Rules)

  • Expression threshold: Genes expressed in >=10% of cells in the cell type (can be reduced to 5-8% for spatial data)
  • Differential gene filtering: log2FC 0.25–1.0, p < 0.05, pct > 0.1
  • Target gene number: 50–500 differential genes recommended to balance signal and noise

TIP

For rare cells or low-coverage data, expression thresholds can be appropriately relaxed or downsampling strategies can be adopted.


SeekSoul™ Online Operation Guide

Parameter Configuration Interface (Overview)

Main Parameter Explanations

  1. Task Name: Starts with English letters, can include underscores and Chinese, with semantic meaning recommended (e.g., nichenet_analysis_2025)
  2. Group.by: Cell annotation column name (e.g., celltype or CellAnnotation)
  3. Ligand Cells (Senders): Multiple selection, 2–5 cell types with secretory functions recommended
  4. Receptor Cells (Receivers): Multiple selection, 1–3 target cell types of interest recommended
  5. Group.by: Column name for processing group/control group (e.g., Condition)
  6. Treat Name vs Control Name: Setting for differential gene analysis
  7. Species: Select human or mouse, affecting prior models and gene symbols

Parameter Filling Recommendations (Platform Practice)

IMPORTANT

Ligand cell selection strategy: Prioritize cells known to have secretory functions (immune cells, stromal cells, fibroblasts, etc.), avoid including non-nucleated cells such as red blood cells.

IMPORTANT

Receptor cell selection strategy: Select target cell types of interest (such as tumor cells, epithelial cells), and ensure sufficient sample size in group comparisons.

Submission and Viewing: Usually takes 30–60 minutes after submission (depending on data scale). After completion, reports and image files can be downloaded from the analysis list.


Result Interpretation

Main Output Files

  • *_ligand_activities.txt: Ligand activity scores (aupr_corrected as the main ranking basis)
  • *_top_ligands.txt: Top ligand list (usually Top30)
  • *_top_targets.txt: Predicted target gene list
  • Visualization files: *_Top_ligands_dot.png, *_ligands_heatmap.png, *_ligand_target_heatmap.png, *_ligand_receptor_heatmap.png, *_ligand_target_circos.png

Ligand Activity Table Interpretation

Column NameMeaning
test_ligandLigand gene name
aurocModel AUROC score
auprAUPR score
aupr_correctedAUPR corrected for unbalanced data (primary sorting basis)
pearsonPearson correlation between prediction and observation
rankRank based on aupr_corrected

NOTE

aupr_corrected is more suitable for single-cell scenarios, prioritizing this indicator for ligand priority determination.

Key Points for Interpreting Common Charts

  • Bubble chart (ligand expression): Point size represents expression proportion in cells, color represents expression level.
  • Ligand differential expression heatmap: Shows differential expression (log2FC) of treatment/control groups in each cell type.
  • Ligand-target gene regulation heatmap: Darker colors indicate stronger predicted regulatory potential.
  • Ligand-receptor heatmap: Used to verify biological rationality and screen possible ligand-receptor pairs.
  • Circos plot: Shows predicted connections between ligands and target genes, line thickness reflects regulation strength.

Screening Strategies (Experience)

  • Preferred: Ligands with high aupr_corrected and expression in ligand cells and whose receptors are expressed in receptor cells
  • Combine biological background and literature, prioritize ligands with relevant functions and verifiable through experiments

Application Cases (Example Summary)

Case: Renal Cell Carcinoma (RCC) Tumor-Microenvironment Interaction Analysis (Brief)

  • Problem: Identify cell communication signals promoting EMT at the tumor-non-tumor boundary
  • Ligand Cells: Macrophage
  • Receptor Cells: Tumor cells

Key Findings: Ligands expressed by macrophages (such as IL1B) have significant regulatory potential on EMT target genes in tumor cells, and the receptor IL1R1 is highly expressed in tumor cells, suggesting the IL1B/IL1R1 pathway may be involved in tumor invasion.

TIP

In case analysis, it is recommended to combine spatial location and immune phenotype to verify prediction results, improving conclusion credibility.


Precautions and Best Practices

Data and Quality Control

WARNING

Cell number requirements: At least 50 cells per cell type recommended; can be appropriately relaxed for rare cells.

WARNING

Gene expression requirements: Ligand and receptor genes need to be expressed in corresponding cell types, otherwise predictions may be false positives.

Parameters and Result Screening

  • Ligand activity screening: aupr_corrected > 0.1 as a reference threshold; auxiliary reference auroc > 0.6
  • Target gene screening: Prioritize genes with high regulatory potential and relevant functions, combined with enrichment analysis to judge biological consistency

Common Misunderstandings

WARNING

Avoid choosing too many ligand/receptor cell combinations, as too many combinations will increase noise and reduce interpretability.

WARNING

NicheNet is a predictive tool, all key conclusions should be combined with literature and experimental verification.


Frequently Asked Questions

Q1: How to select ligand/receptor cells?

A: Ligand cells should prioritize cells with known secretory functions (immune cells, stromal cells, etc.), receptor cells are target cells of research interest, and both should possibly interact spatially.

Q2: What if ligand activity is low?

A: Check ligand/receptor expression, relax differential gene thresholds, or consider other cell combinations.

Q3: How to judge prediction reliability?

A: Combine aupr_corrected, ligand/receptor expression evidence, GO/KEGG enrichment, and literature/experimental verification.


References

  • Browaeys R, Saelens W, Saeys Y. NicheNet: modeling intercellular communication by linking ligands to target genes. Nat Methods. 2020;17(2):159-162.
  • Efremova M, Vento-Tormo M, Teichmann SA, Vento-Tormo R. CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat Protoc. 2020.
0 comments·0 replies